Enhancement of Reverberant and Noisy Speech by Extending Its Coherence

نویسندگان

Scott Wisdom

Thomas Powers

Les Atlas

James Pitton

چکیده

We introduce a novel speech enhancement algorithm for removing reverberation and noise from recorded speech data. Our approach centers around using a single-channel minimum mean-square error log-spectral amplitude (MMSELSA) estimator, which applies gain coefficients in a timefrequency domain to suppress noise and reverberation. The main contribution of this paper is that the enhancement is done in a time-frequency domain that is coherent with speech signals over longer analysis durations than the short-time Fourier transform (STFT) domain. This extended coherence is gained by using a linear model of fundamental frequency variation over the analysis frame. In the multichannel case, we preprocess the data with either a minimum variance distortionless response (MVDR) beamformer, or a delay-and-sum beamformer (DSB). We evaluate our algorithm on the REVERB challenge dataset. Compared to the same processing done in the STFT domain, our approach achieves significant improvement on the REVERB challenge objective metrics, and according to informal listening tests, results in fewer artifacts in the enhanced speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence

Most speech enhancement algorithms make use of the short-time Fourier transform (STFT), which is a simple and flexible time-frequency decomposition that estimates the short-time spectrum of a signal. However, the duration of short STFT frames are inherently limited by the nonstationarity of speech signals. The main contribution of this paper is a demonstration of speech enhancement and automati...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

On the role of missing data imputation and NMF feature enhancement in building synthetic voices using reverberant speech

In this paper, we study the role of a recently proposed feature enhancement technique in building HMM-based synthetic voices using reverberant speech data. The feature enhancement technique studied combines the advantages of missing data imputation and non-negative matrix factorization (NMF) based methods in cleaning up the reverberant features. Speaker adaptation of a clean average voice using...

متن کامل

The Ntu - Adsc Systems for Reverberation Challenge 2014

This paper describes our speech enhancement and recognition systems developed for the Reverberation Challenge 2014. To enhance the noisy and reverberant speech for human listening, besides using conventional methods such as delay and sum beamformer and late reverberation reduction by spectral subtraction, we also studied a novel learning-based speech enhancement. Specifically, we train deep neu...

متن کامل

Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data

In this contribution we investigate the effectiveness of BAYESIAN feature enhancement (BFE) on a medium-sized recognition task containing real-world recordings of noisy reverberant speech. BFE employs a very coarse model of the acoustic impulse response (AIR) from the source to the microphone, which has been shown to be effective if the speech to be recognized has been generated by artificially...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Enhancement of Reverberant and Noisy Speech by Extending Its Coherence

نویسندگان

چکیده

منابع مشابه

Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

On the role of missing data imputation and NMF feature enhancement in building synthetic voices using reverberant speech

The Ntu - Adsc Systems for Reverberation Challenge 2014

Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data

عنوان ژورنال:

اشتراک گذاری